SEQUENCE LISTING 



(1) GENERAL INFORMATION 

(i) APPLICANT: 

Seidel, Christoph; Weinhues, Ursula-Henrike; 
Schmitt, Urban; Motz, Manfred; Wiedmann, Michael; 
Upmeier, Barbara; Soutscheck, Erwin 



01 
IU 



(ii) TITLE OF INVENTION : 

Recombinant antigen from the NS3 region of the hepatitis C virus 

(iii) NUMBER OF SEQUENCES: 9 

(iv) CORRESPONDENCE ADDRESS: 



(A) 


ADDRESSEE : 


Felfe Sc. Lynch 


(B) 


STREET : 


805 Third Avenue 


(C) 


CITY: 


New York 


(D) 


STATE : 


New York 


(E) 


COUNTRY : 


USA 


(F) 


ZIP: 


10022 



£ (v) COMPUTER READABLE FORM: 

01 (A) MEDIUM TYPE: 3.5 inch 1.44 Mb storage diskette 

O (B) COMPUTER: IBM PS/2 

IjJ (C) OPERATING SYSTEM: PC-DOS 

5 (D) SOFTWARE: Patentln Relase #1.0, 

Version #1.25 (EPA) 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/511,759 

(B) FILING DATE: 7 -AUGUST- 19 95y 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION/NUMBER: p 44 2 8 705.4 

(B) FILING DATE: 12 AUGUST 1994 

(viii) ATTORNEY/AGENT INFORMATION 

(A) NAME: HANSON, NORMAN 

(B) REGISTRATION NUMBER: 30, 946 

(C) REFERENCE/DOCKET NUMBER: HUBR 1067.1 

(ix) TELECOMMUNICATION INFORMATION 

(A) TELEPHONE: 212-688-9200 

(B) TELEFAX: 212-838-3884 



# 



(2) INFORMATION FOR SEP ID NO : 1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 885 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : cDNA 



(vi) INITIAL ORIGIN: 

(A) ORGANISM: hepatitis C virus 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME SEGMENT: NS3 

(ix) CHARACTERISTICS: 
(A) NAME/ KEY: CDS 

□ (B) LOCATION: 1..885 

J (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

S.TG ACC ATG ATT ACG AAT TCC CGG GGA TCC ATC ATG AAA TCC CCG GTG 48 
ftlet Thr Met He Thr Asn Ser Arg Gly Ser He Met Lys Ser Pro Val 
= y ! 5 10 15 

a 

5tC ACG GAT AAC TCC TCT CCA CCG GTA GTG CCC CAG AGC TTC CAG GTG 96 
y Phe Thr Asp Asn Ser Ser Pro Pro Val Val Pro Gin Ser Phe Gin Val 
HJ 20 25 30 



QjCT CAC CTG CAT GCT CCC ACA GGC AGC GGC AAG AGC ACC AAG GTC CCG 144 
Hfcla His Leu His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro 
35 40 45 

GCT GCA TAC GCA GCT CAG GGC TAC AAG GTG CTA GTG CTC AAC CCT TCT 192 
Ala Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser 
50 55 60 

GTT GCT GCA ACA TTG GGC TTT GGT GCC TAC ATG TCC AAG GCT CAT GGG 240 
Val Ala Ala Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His Gly 
65 70 75 80 

ATC GAT CCT AAC ATC AGG ACC GGG GTG AGA ACA ATT ACC ACT GGC AGC 288 
He Asp Pro Asn He Arg Thr Gly Val Arg Thr He Thr Thr Gly Ser 

85 90 95 



CCC ATT ACG TAC TCC ACT TAC GGC AAG TTT CTT GCC GAC GGC GGG TGC 
Pro He Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys 
100 105 HO 



336 



PP A 

LLA 


ppp 

LLL 


ppt 
LL1 




«-p7\ T 
1 AI 


PAp 
LAL 


A HP A 

A 1 A 


ATA 


ATT 
Al 1 


TPT 
1 LI 


P A. p 
LAL 


PAP 
LAL 


TPP 

1 LL 


PAP 

LAL 


rppp 

ILL 


A PP 

ALL 


384 


Ala 


Gly 


Gly 


Ala 


Tyr 


Asp 


He 


He 


He 


Cys 


Asp 


Glu 


Cys 


His 


Ser 


Thr 








115 










120 










125 










p a t 

GAT 


f~* r~* 


ACA 


I LL 


AIL 


lib 


ppp 
bbL 


ATP 
AIL 


bbb 


A. PT 
AL 1 


PTP 
L1L 


PTT 
L 1 1 


PAP 

LAL 


PA A 

LAA 


pp 7\ 

LLA 


PAP 

LAL 


j ^ 


Asp 


Ala 


Thr 


Ser 


He 


Leu 


Gly 


He 


Gly 


Thr 


Val 


Leu 


Asp 


Gin 


Gly 


Glu 






130 










135 










140 












ACT 


GCG 


GGG 


GCG 


AAA 


TTG 


GTT 


GTG 


TTC 


rt pp 

GCC 


TV PP 

ACC 


GCC 


tv np 

ACC 


CCT 


CCG 


GGC 


/on 

4 8 0 


Thr 


Ala 


Gly 

J. 


Ala 


Lys 


Leu 


Val 


Val 


Phe 


Ala 


Thr 


Ala 


Thr 


Pro 


Pro 


Gly 




145 








150 










155 










160 




TCC 


GTC 


ACT 


GTG 


/--I /-^ /—i 

CCC 


CAT 


ppp 
CLC 


A A P 

AAL 


Al 1 




PAP 

LAL 


prprp 
LI 1 


pprp 
LL 1 


PT" A 

L 1A 


mpp 

ILL 


A PP 
ALL 


DZO 


Ser 


Val 


Thr 


Val 


Pro 


His 


Pro 


Asn 


He 


Glu 


Glu 


Val 


Ala 


Leu 


Ser 


Thr 












165 










170 










175 






ACC 


GGA 


GAG 


7\ rp p 

ATl 


pprp 

LLl 


TTT 


rp7\ p 
1 AL 


ppp 
bbb 


A AP 

AAL 


PPT 
LL 1 


A TP 
AIL 


ppp 

LLL 


PTT 
L 1 1 


pap 

LAL 


PTA 
L 1 A 


ATP 
AIL 


3 / D 


Thr 


Gly 


Glu 


He 


Pro 


Phe 


Tyr 


Gly 


Lys 


Ala 


He 


Pro 


Leu 


Glu 


Val 


He 








180 










185 










190 








AAG 


GGG 


GGG 


AGA 


CAT 


CTC 


a hpp 

ATL 


TTC 


TGI 


P7\ m 

LAI 


qipA 

1LA 


A A P 

AAL 


A PP 

ALL 


A A P 

AAL 


TPP 
ILL 


PAT 
LAI 


DZri 


CLyS 


Gly 


Gly 


Arg 


His 


Leu 


He 


Phe 


Cys 


His 


Ser 


Lys 


Arg 


Lys 


Cys 


Asp 




195 










200 










205 










CO 

hEAG 


CTC 


p p p 

GLL 


A P A 

ALA 


a a p 
AAL 


prpp 

Lib 


blL 


PPA 
LLA 


ATP 

AIL 


ppp 

LLL 


ATP 


A AT 


ppp 

LLL 


PTP 


ppp 


TAP 


672 


gplu 


Leu 


Ala 


Thr 


Lys 


Leu 


Val 


Ala 


Met 


Gly 


He 


Asn 


Ala 


Val 


Ala 


Tyr 






210 










215 










220 












ill 

fyTAC 


CGC 


GGT 


CTT 


GAC 


GTG 


rppp 
ILL 


LI L 


A TP 1 

AIL 


ppp 

LLL 


A PP 

ALL 


A PP 
ALL 


PPT 
LL 1 


PS T 
LAI 


PTT 
LI 1 


PTP 
L 1 L 


io n 

/ Z VJ 


I Tyr 


Arg 


Gly 


Leu 


Asp 


Val 


Ser 


Val 


He 


Pro 


Thr 


Ser 


Gly 


Asp 


Val 


Val 




H225 










230 










235 










240 




a PTC 


GTG 


GCA 


ACC 


GAC 


Gll 


prpp 
Lib 


A TP 

AIL 


A PP 

ALL 


ppp 
LLL 


TA T 
1 Al 


A. PP 

ALL 


PPP 

LLL 


pap 

LAL 


TTP 
1 1 L 


PAP 


76 ft 

/DO 




Val 


Ala 


Thr 


Asp 


Ala 


Leu 


Met 


Thr 


Gly 


Tyr 


Thr 


Gly 


Asp 


Phe 


Asp 




s" " ; 








245 










250 










255 






L-i 

a ~TCG 


GTG 


ATA 


GAC 


TGC 


AAC 


a pp 

ACG 


mppp 

TLl 


pmp 

GIL 


A PT 1 

AL 1 


PAP 

LAL 


A PA 

ALA 


PTP 

LI L 


PAT 
LAI 


TTP 
1 1 L 


APP 
ALL 


ft 1 £ 
O -L O 


Ser 


Val 


He 


Asp 


Cys 


Asn 


Thr 


Cys 


Val 


Thr 


Gin 


Thr 


Val 


Asp 


Phe 


Ser 










260 










265 










270 








ptt 
LI 1 


PAH 

LAL 


PPT 
LL 1 


ALL 


TTP 
1 X L 


App 


ATT 








APA 


PTT 


CCC 


CAG 


GAT 


GCT 


864 


Leu 


Asp 


Pro 


Thr 


Phe 


Thr 


He 


Glu 


Thr 


Thr 


Thr 


Leu 


Pro 


Gin 


Asp 


Ala 






275 










280 










285 










GTC 


TCC 


CGC 


ACT 


CAA 


CGA 


CGG 




















885 


Val 


Ser 


Arg 


Thr 


Gin 


Arg 


Arg 























290 295 



(3) INFORMATION FOR SEP ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met 
1 


Thr 


Met 


He 


Thr 
5 


Asn 


Ser 


Arg 


Gly 


Ser 
10 


He 


Met 


Lys 


Ser 


Pro 
15 


Val 


Phe 


Thr 


Asp 


Asn 
20 


Ser 


Ser 


Pro 


Pro 


Val 
25 


Val 


Pro 


Gin 


Ser 


Phe 
30 


Gin 


Val 


Ala 


His 


Leu 
35 


His 


Ala 


Pro 


Thr 


Gly 
40 


Ser 


Gly 


Lys 


Ser 


Thr 
45 


Lys 


Val 


Pro 


%ssf 


Ala 
50 


Tyr 


Ala 


Ala 


Gin 


Gly 
55 


Tyr 


Lys 


Val 


Leu 


Val 
60 


Leu 


Asn 


Pro 


Ser 


fn 

^/al 

S 65 


Ala 


Ala 


Thr' 


Leu 


Gly 

70 


Phe 


Gly 


Ala 


Tyr 


Met 

75 


Ser 


Lys 


Ala 


His 


Gly 
80 


Htle 

LU 

; 


Asp 


Pro 


Asn 


He 
85 


Arg 


Thr 


Gly 


Val 


Arg 
90 


Thr 


He 


Thr 


Thr 


Gly 
95 


Ser 


s Pro 


He 


Thr 


Tyr 
100 


Ser 


Thr 


Tyr 


Gly 


Lys 
105 


Phe 


Leu 


Ala 


Asp 


Gly 
110 


Gly 


Cys 


fiAla 


Gly 


Gly 
115 


Ala 


Tyr 


Asp 


He 


He 
1.20 


He 


Cys 


Asp 


Glu 


Cys 
125 


His 


Ser 


Thr 


|=Asp 


Ala 
130 


Thr 


Ser 


lie 


Leu 


Gly 
135 


He 


Gly 


Thr 


Val 


Leu 
140 


Asp 


Gin 


Gly 


Glu 


Thr 
145 


Ala 


Gly 


Ala 


Lys 


Leu 
150 


Val 


Val 


Phe 


Ala 


Thr 
155 


Ala 


Thr 


Pro 


Pro 


Gly 
160- 


Ser 


Val 


Thr 


Val 


Pro 
165 


His 


Pro 


Asn 


He 


Glu 
170 


Glu 


Val 


Ala 


Leu 


Ser 
175 


Thr 


Thr 


Gly 


Glu 


He 
180 


Pro 


Phe 


Tyr 


Gly 


Lys 
185 


Ala 


He 


Pro 


Leu 


Glu 
190 


Val 


lie 


Lys 


Gly 


Gly 
195 


Arg 


His 


Leu 


He 


Phe 
200 


Cys 


His 


Ser 


Lys 


Arg 
205 


Lys 


Cys 


Asp 



Glu Leu Ala Thr Lys Leu Val Ala Met Gly He Asn Ala Val Ala Tyr 
210 215 220 

Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val Val 
225 230 235 240 

Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp 

245 250 255 

Ser Val He Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe Ser 
260 265 270 

Leu Asp Pro Thr Phe Thr He Glu Thr Thr Thr Leu Pro Gin Asp Ala 
275 280 285 

Val Ser Arg Thr Gin Arg Arg 
290 295 



^(4) INFORMATION FOR SEP ID NO : 3: 

•? (i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 40 base pairs 

0 1 (B) TYPE: nucleic acid 

O (C) STRANDEDNESS : single 

W (D) TOPOLOGY: linear 

9 (ii) MOLECULE TYPE: cDNA 

m (xi) SEQUENCE DESCRIPTION: .SEQ ID NO: 3: 

fi ; 

3 W 

■S3. 

w 

pAAGGGATCCA TCATGAAATC CCCGGTGTTC ACGGATAACT 40 



(5) INFORMATION FOR SEP ID NO : 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GGGAAGCCTT AATTCTTACC GTCGTTGAGT GCGGGAGAC 



(6) INFORMATION FOR SEP ID NO : 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

GAGGGATCCA TCATGAAAGC GGTGGACTTT ATCCCTGTG 



(7) INFORMATION FOR SEP ID NO : 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



§y (xi) SEQUENCE DESCRIPTION: SEQ ID NO 

a 

O GAGAAGCTTT TAACACGTGT TGCAGTCTAT CAC 
RJ (8) INFORMATION FOR SEP ID NO : 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TQPPLPGY: linear 

(ii) MGLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
-GAGGGATCCA TCATGAAACA CCTGCATGCT CCCACCGGC 



(8) INFORMATION FOR SEP ID NO : 8: 



t y 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GAGAAGCTTT TAATACCAAG CACAGCCTGC GTC 33 

(3) INFORMATION FOR SEQ ID NO: 9: 

1 (i) SEQUENCE CHARACTERISTICS: 
\ (A) LENGTH: 302 amino acids 

t (B) TYPE: amino acid 

j (D) TOPOLOGY: linear 

] (ii) MOLECULE TYPE: protein «- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NJ) : 9 : ^jj^ ^ V j^f - j^f^^ 

l r f" w % 

UPro Val Glu Asn Leu Glu Thr Thr Met Arg Ser Pro Val Phe Thr 

01 1 5 ■ 10 # " 15 

s ; j 

J3Asp Asn Ser Ser Pro Pro Val Val Pro Gin Ser Phe Gin Val Ala 

O 20 ' 25 30 

M> 

His Leu His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro 

35 40 45 

Ala Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro 

50 55 60 

Ser Val Ala Ala Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala 

65 70 75 

His Gly lie Asp Pro Asn lie Arg Thr Gly Val Arg Thr lie Thr 

80 85 90 

Thr Gly Ser Pro lie Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala 

95 100 105 



Asp Gly Gly Cys Ala Gly Gly Ala Tyr Asp lie lie lie Cys Asp 

110 115 120 

Glu Cys His Ser Thr Asp Ala Thr Ser lie Leu Gly lie Gly Thr 

125 130 135 

Val Leu Asp Gin Gly Glu Thr Ala Gly Ala Lys Leu Val Val Phe 

140 145 150 

Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Val Pro His Pro Asn 

155 160 165 

lie Glu Glu Val Ala Leu Ser Thr Thr Gly Glu lie Pro Phe Tyr 

170 175 180 

Gly Lys Ala lie Pro Leu Glu Val lie Lys Gly Gly Arg His Leu 

185 190 195 

lie Phe Cys His Ser Lys Arg Lys Cys Asp Glu Leu Ala Thr Lys 

200 205 210 

Leu Val Ala Met Gly lie Asn Ala Val Ala Tyr Tyr Arg Gly Leu 
Q 215 220 225 

Clsp Val Ser Val lie Pro Thr Ser Gly Asp Val Val Val Val Ala 
ij3 230 235 240 

fThr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val 
yj 245 250 255 

s Ile Asp Cys Asn Thr Cys Val Thr Gin Thr. Val Asp Phe Ser Leu 
L 260 265 . 270 

ji .....^ 

Stsp Pro Thr Phe Thr lie Glu Thr Thr Thr Leu Pro Gin Asp Ala 

275 280 285 

Nfal Ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys Pro 
H= 290 / 295 300 



Gly lie 
302 



# 



SEQUENCE PROTOCOL 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Boehringer Mannheim GmbH 

(B) ROAD: Sandhofer Str. 112-132 

(C) CITY: Mannheim 

(E) COUNTRY: Germany 

(F) POSTAL CODE: 68305 

(ii) TITLE OF APPLICATION: Recombinant antigen from 

the NS3 region of the hepatitis C virus 

(iii) NUMBER OF SEQUENCES: 8 

(iv) COMPUTER READABLE FORM: 

(A) DATA CARRIER: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, 

Version #1.25 (EPA) 

[2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 885 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND FORM: both 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: cDNA 



# 4 

(vi) INITIAL ORIGIN: 

(A) ORGANISM: hepatitis C virus 

(viii) POSITION IN THE GENOME: 

(A) CHROMOSOME/ SEGMENT: NS3 

(ix) CHARACTERISTICS: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..885 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



ATG ACC ATG ATT ACG AAT TCC CGG GGA TCC ATC ATG AAA TCC CCG GTG 
Met Thr Met lie Thr Asn Ser Arg Gly Ser lie Met Lys Ser Pro Val 
y 1 5 10 15 

Ire ACG GAT AAC TCC TCT CCA CCG GTA GTG CCC CAG AGC TTC CAG GTG 
®ie Thr Asp Asn Ser Ser Pro Pro Val Val Pro Gin Ser Phe Gin Val 
01 20 25 30 

o 

gCT CAC CTG CAT GCT CCC ACA GGC AGC GGC AAG AGC ACC AAG GTC CCG 
Ala His Leu His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro 
35 40 45 



5 



§£T GCA TAC GCA GCT CAG GGC TAC AAG GTG CTA GTG CTC AAC CCT TCT 
Ma Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser 
^ 50 55 60 

CUT GCT GCA ACA TTG GGC TTT GGT GCC TAC ATG TCC AAG GCT CAT GGG 
j^al Ala Ala Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His Gly 
65 70 75 80 

ATC GAT CCT AAC ATC AGG ACC GGG GTG AGA ACA ATT ACC ACT GGC AGC 

He Asp Pro Asn He Arg Thr Gly Val Arg Thr He Thr Thr Gly Ser 

85 90 95 

CCC ATT ACG TAC TCC ACT TAC GGC AAG TTT CTT GCC GAC GGC GGG TGC 
Pro He Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys 
100 105 110 

GCA GGG GGT GCT TAT GAC ATA ATA ATT TGT GAC GAG TGC CAC TCC ACG 
Ala Gly Gly Ala Tyr Asp He He He Cys Asp Glu Cys His Ser Thr 
115 120 125 

GAT GCC ACA TCC ATC TTG GGC ATC GGC ACT GTC CTT GAC CAA GGA GAG 
Asp Ala Thr Ser He Leu Gly He Gly Thr Val Leu Asp Gin Gly Glu 
130 135 140 * 1 



ACT GCG GGG GCG AAA^R GIT GTG TTC GCC ACC GCdJftc CCT CCG GGC 480 

Thr Ala Gly Ala Lys Eeu Val Val Phe Ala Thr Ala Thr Pro Pro Gly 
145 150 155 160 

TCC GTC ACT GTG CCC CAT CCC AAC ATT GAG GAG GTT GCT CTA TCC ACC 528 
Ser Val Thr Val Pro His Pro Asn lie Glu Glu Val Ala Leu Ser Thr 

165 170 175 

ACC GGA GAG ATC CCT TTT TAC GGC AAG GCT ATC CCC CTT GAG GTA ATC 57 6 

Thr Gly Glu lie Pro Phe Tyr Gly Lys Ala lie Pro Leu Glu Val lie 
180 185 190 

AAG GGG GGG AGA CAT CTC ATC TTC TGT CAT TCA AAG AGG AAG TGC GAT 624 
Lys Gly Gly Arg His Leu He Phe Cys His Ser Lys Arg Lys Cys Asp 
195 200 205 

GAG CTC GCC ACA AAG CTG GTC GCA ATG GGC ATC AAT GCC GTG GCC TAC 672 
Glu Leu Ala Thr Lys Leu Val Ala Met Gly He Asn Ala Val Ala Tyr 
210 215 220 

TAC CGC GGT CTT GAC GTG TCC GTC ATC CCG ACC AGC GGT GAT GTT GTC 72 0 

Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val Val 
225 230 235 240 

GTC GTG GCA ACC GAC GCC CTC ATG ACC GGC TAT ACC GGC GAC TTC GAC 768 
Vj|l Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp 
J 245 250 255 

Co 

T/|G GTG ATA GAC TGC AAC ACG TGT GTC ACT CAG ACA GTC GAT TTC AGC 816 
S&r Val He Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe Ser 
£| 260 265 270 

d#T GAC CCT ACC TTC ACC ATT GAG ACG ACC ACA CTT CCC CAG GAT GCT 864 
Beu Asp Pro Thr Phe Thr He Glu Thr Thr Thr Leu Pro Gin Asp Ala 
a_ 275 280 285 

gJc TCC CGC ACT CAA CGA CGG 885 
Val Ser Arg Thr Gin Arg Arg 
290 295 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

/ZC7 

Met Thr Met He Thr Asn Ser Arg Gly Ser He Met Lys Ser Pro 



Val 



# 



Phe Thr Asp Asn Ser Ser Pro Pro Val 
20 25 

Ala His Leu His Ala Pro Thr Gly Ser 
35 40 

Ala Ala Tyr Ala Ala Gin Gly Tyr Lys 
50 55 

Val Ala Ala Thr Leu Gly Phe Gly Ala 
65 70 

lie Asp Pro Asn lie Arg Thr Gly Val 

85 

Pro lie Thr Tyr Ser Thr Tyr Gly Lys 

100 105 

Ala Gly Gly Ala Tyr Asp lie lie lie 
S 115 120 

pg Asp Ala Thr Ser lie Leu Gly lie Gly 
,a 130 135 

q Thr Ala Gly Ala Lys Leu Val Val Phe 
H 145 150 

nJ ser Val Thr Val Pro His Pro Asn lie 
3 165 

Q 

£n Thr Gly Glu lie Pro Phe Tyr Gly Lys 
nj 180 185 

if! 

S Lys Gly Gly Arg His Leu lie Phe Cys 
ft 195 200 



10 
Val Pro 

Gly Lys 

Val Leu 

Tyr Met 
75 

Arg Thr 
90 

Phe Leu 
Cys Asp 
Thr Val 



Gin 

Ser 

Val 
60 

Ser 
He 
Ala 
Glu 



15 

Ser Phe Gin Val 
30 

Thr Lys Val Pro 
45 

Leu Asn Pro Ser 



Leu 
140 



Ala Thr 
155 



Glu Glu 
170 



Ala 
Val 
Ala He Pro 
His Ser Lys 



Glu Leu Ala Thr Lys Leu Val Ala Met Gly He Asn 
210 215 220 

Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser 
225 230 235 

Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr 

245 250 

Ser Val He Asp Cys Asn Thr Cys Val Thr Gin Thr 
260 265 

Leu Asp Pro Thr Phe Thr He Glu Thr Thr Thr Leu 
275 280 

Val Ser Arg Thr Gin Arg Arg 
290 295 



Lys Ala His Gly 
80 

Thr Thr Gly Ser 
95 

Asp Gly Gly Cys 
110 

Cys His Ser Thr 
125 

Asp Gin Gly Glu 



Thr Pro Pro Gly 
160 

Ala Leu Ser Thr 
175 

Leu Glu Val He 
190 

Arg Lys Cys Asp 
205 

Ala Val Ala Tyr 



Gly Asp Val Val 
240 

Gly Asp Phe Asp 
255 

Val Asp Phe Ser 
270 

Pro Gin Asp Ala 
285 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND FORM: single 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

AAGGGATCCA TCATGAAATC CCCGGTGTTC ACGGATAACT 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND FORM: single 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

GGGAAGCCTT AATTCTTACC GTCGTTGAGT GCGGGAGAC 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND FORM: singl 



(D) TOPOLOGY: linear 



(ii) TYPE OF MOLECULE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

GAGGGATCCA TCATGAAAGC GGTGGACTTT ATCCCTGTG 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND FORM: single 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

GAGAAGCTTT TAACACGTGT TGCAGTCTAT CAC 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND FORM: single 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
GAGGGATCCA TCATGAAACA CCTGCATGCT CCCACCGGC 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND FORM: single 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 

GAGAAGCTTT TAATACCAAG CACAGCCTGC GTC 



